Dimension Reduction Based on Canonical Correlation
نویسندگان
چکیده
Dimension reduction is helpful and often necessary in exploring nonlinear or nonparametric regression structures with a large number of predictors. We consider using the canonical variables from the design space whose correlations with a spline basis in the response space are significant. The method can be viewed as a variant of sliced inverse regression (SIR) with simple slicing replaced by Bspline basis functions. The asymptotic distribution theory we develop extends to weakly dependent stationary sequences and enables us to consider asymptotic tests that are useful in determining the number of significant dimensions for modeling. We compare several tests for dimensionality and make specific recommendations for dimension selection based on our theoretical and empirical studies. These tests apply to any form of SIR. The methodology and some of the practical issues are illustrated through a tuition study of American colleges.
منابع مشابه
Analysis of Correlation Based Dimension Reduction Methods
Dimension reduction is an important topic in data mining and machine learning. Especially dimension reduction combined with feature fusion is an effective preprocessing step when the data are described by multiple feature sets. Canonical Correlation Analysis (CCA) and Discriminative Canonical Correlation Analysis (DCCA) are feature fusion methods based on correlation. However, they are differen...
متن کاملMethods of Canonical Analysis for Functional Data
We consider estimates for functional canonical correlations and canonical weight functions. Four computational methods for the estimation of functional canonical correlation and canonical weight functions are proposed and compared, including one which is a slight variation of the spline method proposed by Leurgans, Moyeed and Silverman (1993). We propose dimension reduction and dimension augmen...
متن کاملDimension reduction for individual ica to decompose FMRI during real-world experiences: principal component analysis vs. canonical correlation analysis
Group independent component analysis (ICA) with special assumptions is often used for analyzing functional magnetic resonance imaging (fMRI) data. Before ICA, dimension reduction is applied to separate signal and noise subspaces. For analyzing noisy fMRI data of individual participants in free-listening to naturalistic and long music, we applied individual ICA and therefore avoided the assumpti...
متن کاملA Latent Variable Model for Two-Dimensional Canonical Correlation Analysis and its Variational Inference
Describing the dimension reduction (DR) techniques by means of probabilistic models has recently been given special attention. Probabilistic models, in addition to a better interpretability of the DR methods, provide a framework for further extensions of such algorithms. One of the new approaches to the probabilistic DR methods is to preserving the internal structure of data. It is meant that i...
متن کاملAsymptotic expansions of test statistics for dimensionality and additional information in canonical correlation analysis when the dimension is large
This paper examines asymptotic expansions of test statistics for dimensionality and additional information in canonical correlation analysis based on a sample of size N = n+1 on two sets of variables, i.e., xu; p1×1 and xv; p2×1. These problems are related to dimension reduction. The asymptotic approximations of the statistics have been studied extensively when dimensions p1 and p2 are fixed an...
متن کامل